Machine Learning MNIST with TF

This notebook is based upon the notebook published here https://github.com/random-forests/tutorials/blob/master/ep7.ipynb. I simply adopt it to the current Tensorflow Version (1.0.0).



In [1]:

    
import numpy as np
import matplotlib.pyplot as plt
%matplotlib inline
import tensorflow as tf
learn = tf.contrib.learn
tf.logging.set_verbosity(tf.logging.ERROR)

Import the dataset



In [2]:

    
mnist = learn.datasets.load_dataset('mnist')
data = mnist.train.images
labels = np.asarray(mnist.train.labels, dtype=np.int32)
test_data = mnist.test.images
test_labels = np.asarray(mnist.test.labels, dtype=np.int32)









    



Extracting MNIST-data/train-images-idx3-ubyte.gz
Extracting MNIST-data/train-labels-idx1-ubyte.gz
Extracting MNIST-data/t10k-images-idx3-ubyte.gz
Extracting MNIST-data/t10k-labels-idx1-ubyte.gz

There are 55k examples in train, and 10k in eval. You may wish to limit the size to experiment faster.



In [3]:

    
max_examples = 10000
data = data[:max_examples]
labels = labels[:max_examples]

Display some digits



In [4]:

    
def display(i):
    img = test_data[i]
    plt.title('Example %d. Label: %d' % (i, test_labels[i]))
    plt.imshow(img.reshape((28,28)), cmap=plt.cm.gray_r)



In [5]:

    
display(0)



In [6]:

    
display(1)

These digits are clearly drawn. Here's one that's not.



In [7]:

    
display(8)

Now let's take a look at how many features we have.



In [8]:

    
print len(data[0])

Fit a Linear Classifier

Our goal here is to get about 90% accuracy with this simple classifier. For more details on how these work, see https://www.tensorflow.org/versions/r0.10/tutorials/mnist/beginners/index.html#mnist-for-ml-beginners



In [9]:

    
feature_columns = learn.infer_real_valued_columns_from_input(data)
classifier = learn.LinearClassifier(feature_columns=feature_columns, n_classes=10)
classifier.fit(data, labels, batch_size=100, steps=1000)









    



/usr/local/lib/python2.7/dist-packages/tensorflow/python/util/deprecation.py:247: FutureWarning: comparison to `None` will result in an elementwise object comparison in the future.
  equality = a == b






    Out[9]:





LinearClassifier(params={'gradient_clip_norm': None, 'head': <tensorflow.contrib.learn.python.learn.estimators.head._MultiClassHead object at 0x7f90fec1f610>, 'joint_weights': False, 'optimizer': None, 'feature_columns': [_RealValuedColumn(column_name='', dimension=784, default_value=None, dtype=tf.float32, normalizer=None)]})

Evaluate accuracy



In [10]:

    
classifier.evaluate(test_data, test_labels)
print classifier.evaluate(test_data, test_labels)["accuracy"]

Classify a few examples

We can make predictions on individual images using the predict method



In [11]:

    
# here's one it gets right
idx = [0]
predictions = classifier.predict(x=np.array(test_data[idx]))
for i, p in enumerate(predictions):
    print("Predicted %d, Label: %d" % (p, test_labels[idx[i]]))
    display(idx[i])









    



Predicted 7, Label: 7



In [12]:

    
# here's one it gets wrong
idx = [8]
predictions = classifier.predict(x=np.array(test_data[idx]))
for i, p in enumerate(predictions):
    print("Predicted %d, Label: %d" % (p, test_labels[idx[i]]))
    display(idx[i])









    



Predicted 6, Label: 5

Visualize learned weights

Let's see if we can reproduce the pictures of the weights in the TensorFlow Basic MNSIT tutorial.



In [13]:

    
weights = classifier.weights_
f, axes = plt.subplots(2, 5, figsize=(10,4))
axes = axes.reshape(-1)
for i in range(len(axes)):
    a = axes[i]
    a.imshow(weights.T[i].reshape(28, 28), cmap=plt.cm.seismic)
    a.set_title(i)
    a.set_xticks(()) # ticks be gone
    a.set_yticks(())
plt.show()

Next steps

TensorFlow Docker images: https://hub.docker.com/r/tensorflow/tensorflow/
TF.Learn Quickstart: https://www.tensorflow.org/versions/r0.9/tutorials/tflearn/index.html
MNIST tutorial: https://www.tensorflow.org/tutorials/mnist/beginners/index.html
Visualizating MNIST: http://colah.github.io/posts/2014-10-Visualizing-MNIST/
Additional notebooks: https://github.com/tensorflow/tensorflow/tree/master/tensorflow/tools/docker/notebooks
More about linear classifiers: https://www.tensorflow.org/versions/r0.10/tutorials/linear/overview.html#large-scale-linear-models-with-tensorflow
Much more about linear classifiers: http://cs231n.github.io/linear-classify/
Additional TF.Learn samples: https://github.com/tensorflow/tensorflow/tree/master/tensorflow/examples/skflow